AITopics | apache hadoop

Collaborating Authors

apache hadoop

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Performance Evaluation of Query Plan Recommendation with Apache Hadoop and Apache Spark

Azhir, Elham, Hosseinzadeh, Mehdi, Khan, Faheem, Mosavi, Amir

arXiv.org Artificial IntelligenceSep-17-2022

Access plan recommendation is a query optimization approach that executes new queries using prior created query execution plans (QEPs). The query optimizer divides the query space into clusters in the mentioned method. However, traditional clustering algorithms take a significant amount of execution time for clustering such large datasets. The MapReduce distributed computing model provides efficient solutions for storing and processing vast quantities of data. Apache Spark and Apache Hadoop frameworks are used in the present investigation to cluster different sizes of query datasets in the MapReduce-based access plan recommendation method. The performance evaluation is performed based on execution time. The results of the experiments demonstrated the effectiveness of parallel query clustering in achieving high scalability. Furthermore, Apache Spark achieved better performance than Apache Hadoop, reaching an average speedup of 2x.

data mining, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2210.07143

Country:

Asia > Middle East > Iran > Tehran Province > Tehran (0.05)
South America > Peru > Cusco Department > Cusco Province > Cusco (0.04)
North America > United States > Oregon > Multnomah County > Portland (0.04)
(9 more...)

Genre: Research Report > New Finding (1.00)

Industry: Telecommunications (0.46)

Technology:

Information Technology > Databases (1.00)
Information Technology > Data Science > Data Mining > Big Data (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval > Query Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Add feedback

Apache Mahout – Fly spaceships with your mind

#artificialintelligenceNov-21-2020, 09:11:20 GMT

apache hadoop, scalable algorithm

#artificialintelligence

Industry:

Government > Military > Air Force (0.40)
Aerospace & Defense (0.40)

Technology:

Information Technology > Data Science > Data Mining > Big Data (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.95)

Add feedback

Career: Top 20 Technology Skills in Data Scientist Job Listings - Welcome.AI

#artificialintelligenceJan-26-2019, 16:04:34 GMT

A NoSQL database provides a mechanism for storage and retrieval of data that is modeled in means other than the tabular relations used in relational databases. It is a symbolic math library, and is also used for machine learning applications such as neural networks. It has imperative, object-oriented and generic programming features, while also providing facilities for low-level memory manipulation. The technology allows subscribers to have at their disposal a virtual cluster of computers, available all the time. Python has a design philosophy that emphasizes code readability, notably using significant whitespace. It provides constructs that enable clear programming on both small and large scales.

data mining, machine learning, programming language, (12 more...)

#artificialintelligence

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.36)

Add feedback

The Open Source Roots of Machine Learning

#artificialintelligenceMay-7-2018, 15:53:34 GMT

The concept of machine learning, which is a subset of artificial intelligence, has been around for some time. Ali Ghodsi, an adjunct professor at UC Berkeley, describes it as "an advanced statistical technique to make predictions on a massive amount of data." Ghodsi has been influential in areas of Big Data, distributed systems, and in machine learning projects including Apache Spark, Apache Hadoop, and Apache Mesos. Here, he shares insight on these projects, various use-cases, and the future of machine learning. There are some commonalities among these three projects that have been influenced by Ghodsi's research.

artificial intelligence, data mining, machine learning, (12 more...)

#artificialintelligence

Industry:

Education > Educational Setting > Higher Education (0.56)
Health & Medicine (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.58)
Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (0.52)

Add feedback

51 Big Data Terms You Need to Know - DZone Big Data

@machinelearnbotSep-27-2017, 22:15:43 GMT

With billions of bytes of data being collected daily, it's more important than ever to understand the intricacies of big data. In an effort to help bring clarity to this field, we created a compiled list from our recent big data guides of what we feel are the most important related terms and definitions you need to know. Any terms you think we should add? Let us know in the comments! Algorithm: A set of rules given to an AI, neural network, or other machines to help it learn on its own; classification, clustering, recommendation, and regression are four of the most popular types.

artificial intelligence, data mining, information, (15 more...)

@machinelearnbot

Industry: Information Technology (0.52)

Technology:

Information Technology > Data Science > Data Mining > Big Data (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

277 Data Science Key Terms, Explained

@machinelearnbotSep-1-2017, 15:00:05 GMT

This post presents a collection of data science related key terms with concise, no-nonsense definitions, organized into 12 distinct topics. Starting with Big Data and progressing through to natural language processing, this definition train has stops at machine learning, databases, Apache Hadoop, and several more. It may take come time, but once you get through the terminology presented herein, you should have a good idea of the key terms of importance in data science. And don't worry if the definitions are too slim for you; links abound for expanded related reading opportunities where appropriate. If somehow you've made it to this website and have not heard the term since it first gained momentum toward becoming a popular term at least a decade and a half ago, I really don't know what to say.

data mining, machine learning, natural language, (14 more...)

@machinelearnbot

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.81)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.78)

Add feedback

Industrial Best Practices of #DataScience in #Healthcare

#artificialintelligenceFeb-2-2017, 17:30:08 GMT

The technological framework for healthcare information systems has a new paradigm to handle fast and accelerated medical data coming from disparate sources of data from the holistic healthcare framework of diagnostics tools, DNA mapping, precision medicine, bioinformatics, medical devices, Internet of Medical Things, biopharma, neurology, cardiovascular, drug discovery, and drug development. To surpass the healthcare challenges, increased costs for the individuals, clinical trials, and radiology providers. The majority of the problems stem from the lack of data liquidity and real-time data analytics in healthcare information systems. Healthcare providers adopting big data technologies such as Apache Hadoop can resolve major conundrums with data liquidity (Sears, 2013). Recently McKinsey and Company has released a research report describing the new value pathways for the healthcare system that enables the creation of the data and to make data flows more agiler (Sears, 2013).

artificial intelligence, big data, data mining, (15 more...)

#artificialintelligence

Genre:

Research Report > Experimental Study (0.73)
Research Report > New Finding (0.58)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.56)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Data Science > Data Mining > Big Data (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

What is machine learning?

#artificialintelligenceMay-12-2016, 08:15:39 GMT

Machine learning is the process of building analytical models to automatically discover previously unknown patterns from data that indicate associations, sequences, anomalies (outliers), classifications, and clusters and segments. These patterns reveal hidden rules as to why an event happened--for example, rules that predict likely customer churn. The widely used Cross Industry Standard Process for Data Mining (CRISP-DM) methodology is used to develop predictive analytical models. CRISP-DM includes six phases: business understanding, data understanding, data preparation, model development using supervised and unsupervised learning, model evaluation and model deployment. The business understanding phase involves defining the business problem or use case, the business objectives and the business questions that need to be answered.

artificial intelligence, data mining, machine learning, (19 more...)

#artificialintelligence

Industry: Information Technology (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.50)
Information Technology > Data Science > Data Mining > Big Data (0.34)

Add feedback